Multi-armed bandit - PDFSEARCH.IO - Document Search Engine

Multi-armed bandit
Results: 113

#	Item
31	Multi-Bandit Best Arm Identification V. Gabillon, M. Ghavamzadeh, A. Lazaric & S. Bubeck Sequel Group Meeting, 21 octobre, 2011. An Example Add to Reading List Source URL: victorgabillon.nfshost.com Language: English - Date: 2011-10-25 11:18:31 Statistics Machine learning Multi-armed bandit Stochastic optimization Bandit Variance
32	CSStat 260, Fall 2014: Learning in Sequential Decision Problems Lectures: Evans 334. Tuesday/Thursday 2:00-3:30. Instructor: Peter Bartlett http://www.stat.berkeley.edu/∼bartlett Add to Reading List Source URL: www.stat.berkeley.edu Language: English - Date: 2014-08-28 11:49:09 Mathematical optimization Operations research Mathematical analysis Numerical analysis Convex optimization Stochastic optimization Multi-armed bandit Game theory Linear programming AMPL
33	Deterministic MDPs with Adversarial Rewards and Bandit Feedback Raman Arora TTIC 6045 S. Kenwood Ave. Chicago, IL 60637, USA Add to Reading List Source URL: dept.stat.lsa.umich.edu Language: English - Date: 2012-09-12 18:50:24 Markov models Markov processes Stochastic optimization Mathematical optimization Operations research Reinforcement learning Markov decision process Algorithm Multi-armed bandit Dynamic programming Shortest path problem PP
34	Rollout Allocation Strategies for Classification-based Policy Iteration Victor Gabillon Alessandro Lazaric Add to Reading List Source URL: victorgabillon.nfshost.com Language: English - Date: 2010-07-01 09:47:14 Mathematics Mathematical analysis Artificial intelligence Backgammon Rollout Markov decision process Multi-armed bandit Reinforcement learning Inverted pendulum Pendulum Prime-counting function Valuation
35	An Empirical Evaluation of Thompson Sampling Lihong Li Yahoo! Research Santa Clara, CA Add to Reading List Source URL: papers.nips.cc Language: English - Date: 2014-02-24 03:34:34 Statistics Probability distributions Statistical inference Estimation theory Machine learning Stochastic optimization Bayesian inference Sampling Normal distribution Beta distribution Multi-armed bandit Confidence interval
36	Two-Sided Bandits and the Dating Market Sanmay Das Center for Biological and Computational Learning and Computer Science and Artificial Intelligence Lab Massachusetts Institute of Technology Cambridge, MA 02139 Add to Reading List Source URL: faculty.chicagobooth.edu Language: English - Date: 2006-08-08 10:56:19 Matching Combinatorics Game theory Fellows of the Econometric Society Cooperative games Stable marriage problem CC Reinforcement learning Multi-armed bandit Alvin E. Roth Greedy algorithm Algorithm
37	Multi-armed Bandit Problems with History Pannagadatta Shivaswamy and Thorsten Joachims Department of Computer Science, Cornell University, Ithaca NY {pannaga,tj}@cs.cornell.edu 1 Add to Reading List Source URL: snowbird.djvuzone.org Language: English - Date: 2011-02-10 15:51:00
38	Multi-armed bandit experiments in the online service economy Steven L. Scott December 20, 2014 Abstract The modern service economy is substantively different from the agricultural and manufacturing economies that precede Add to Reading List Source URL: faculty.chicagobooth.edu Language: English - Date: 2015-01-20 12:35:42
39	Adaptive Algorithms for Fixed-Cost Multi-Armed Bandit Problems with Budget Constraints Sandip Sen Anton Ridgway Add to Reading List Source URL: swarmlab.unimaas.nl Language: English - Date: 2014-03-15 02:54:22
40	Multi-Armed Bandit Models for 2D Grasp Planning with Uncertainty Michael Laskey1 , Jeff Mahler1 , Zoe McCarthy1 , Florian T. Pokorny1 , Sachin Patil1 , Jur van den Berg4 , Danica Kragic3 , Pieter Abbeel1 , Ken Goldberg2 Add to Reading List Source URL: www.ieor.berkeley.edu Language: English - Date: 2015-08-31 02:12:22

UPDATE